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ABSTRACT 


The mathematics program of an elementary school was 
changed so that the classes were partitioned into smaller 
sections of comparable ability. This change reduced the 
miai-teacher ratio and the length of the instruction 
period. The proposal produced an unanswered question: 
Met there be an increased achievement gain during the 
Semool year which can be attributed to the program change? 
imgemschiools served as control groups for the experiment. 
Tae) California Achievement Test, 1970 Edition, was used 
in a pretest-posttest design to measure achievement levels 


berore and after the elapsed time of the experiment. The 


* ~~ + 1 (f 
Guestios fa i 


1 . 77 ce 
Gor bd bd 4 = . * = a 
PSeeRNct oo) CUlicoie Le tt RTS AEE ZS 


Cb 
Cn 
@ 


. 8 
Ld 


lado 


Various statistical techniques: randomized matched subjects 
G@eeon, analysis of cowariance and the non-parametric Mann- 
Whitney U test. The several techniques were applied since 


no single standard practice was available for this problen. 
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I. INTRODUCTION 


On September 15, 1970, the author attended a PTA meeting 
ae his child's school. The primary purposemomeeic mecca: 
was to discuss a propcsal for a change in the mathematics 
meseruction at the second grade level and to obtain the san 
mmeval of the parents involved. 

The proposal was primarily directed at the large pupil- 
Meecher ratios, approximately 30 to 1, existing in the five 
second grade classrooms. Essentially, tne pupils in the 
eeeond grade would be divided into ten Sections (vice five 
O@imsseS), with five of the sections being taught in the morn- 


mieeand five in the afternoon. The same five classroom 
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ratio would be virtually halved by the proposed change. Also 
emeesiiailer sections were to be composed of pupils of con- 
wamepie ability. The parents were unaware of this aspect of 
the proposal. | 

A scheduling problem was inherent in the proposal due 
to the morning and afternoon sessions and the limited number 
of teaching hours available during the day. This problem was 
eemved by having the second graders arrive and depart the 
School at times different from the normal school routine and 
by reducing the normal 40 minute math period to 30 minutes 
in the morning session and to 35 minutes in the Sete neon 
session. These scheduling changes allowed the teachers to 


Semain within their ala@licted teaching hours. 








The main point of concern to thes pancnGsmeemeoncm 
around the possible inconvenience caused by the different 
arrival and departure times of the second graders involved. 
In the discussion that followed it was implicitly assumed 
imate the achievement level of Ghe pupilssyould be Duchom 
under the proposed change; this assumption was never ques- 
tioned. The primary reason for the assumption was the re- 
duced pupil-teacher ratio made possible by the increased 
mumtber Of sections. The reduction of the math periods by 
ten minutes in the morning session and five minutes in the 
afternoon session was not felt to be significant by compari- 
son. 

The proposal was adapted and implemented the following 
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considered. The objective of this thesis, then, is to de- 
Domamine if there was a Significant increase im mathematical 
fomerty of the second graders anvolved which could be at- 
muaeieced to the change in mathematics instruction. 

Measuring and evaluating growth objectively is a clas- 
See problem in psychometrics. It has been debated exten- 
Sively in the literature by Lord [Ref. 1, 2, and 3], McNemar 
[Ref. 4], Cronbach and ENeGESE [Rese S$] and others. Reference 
6 is a collection of papers all dealing with the same problem 
mmenance: hew to measure andyevaluate it. Cronbach and) Furby 
have even gone so far as to suggest that measurement cf 
change should not be done. They do present a model for anaé- 


lyzing individual change which they consider b tter than 








other available models, but they recommend stiiawee te erecuila 
be interpreted with caution. The purpose here is neither to 
add nor subtract from the sometimes analytical, sometimes 
paalosophical, arguments found in them@l@terature but compcc. 
ceed with models that have been used in the past to measure 
and evaluate change. 

It was probable that mathematical skills would increase 
over the school year with or without the implementation of 
the proposal. This created a problem in isolating that por- 
Mmiomeor achievement, if any, due to the adopted proposal. 
ies and Other extraneous factors led to the selection of a 
eencrol group. Iwo requirements were considered necessary 
mere che control group. The first was that the pupils be 
meeeeonoGa to classes’ in the normally uccepted maine r@end 
mime Classes not be divided into smaller sections. The sec- 
Sma requirement was that the pupils in the control group be 
Of comparable ability with those in the experimental group. 
The first requirement seemed easy enough to satisfy, aut 
the second seemed very difficult without prior knowledge of 
mathematical ability of both groups. 

PeerObIeClive Measurime devree Was also meeded toumed- 
eee mathematical ability. The most commonly accepted way 
of measuring knowledge in educational institutions is by 
Mmemese Of tests, be they written, oral, or other. It was 
decided to give a written pretest and posttest to both the 
Esperimental and control groups. It was felt that this DGe> 
Mest-posttest design would alleviate some of the aifficul- 


Pres in meeting the Seconda requirement mentioned above. 
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Fortunately, the author wasereferredsto ClG, Mena. 
Hill, Monterey, Califcymia, Sy his thesissadyisen. se bouelcs 
J. McRae, of the research department ofthat stun.) tec anal 
miterest in thisproject quupon hasmuncie eee 111m a emecd 
memturmish their Califormia Achiévement Pests ro] 0  beienenm 
to the schools involved. Consultation with Mr. McRae and 
mre principal of the e€xpeérimental school aiso led to the 
memeection of a control group. 

It 1S widely accepted that a positive correlation 
exists between socio-economic level and academic achievement. 
It was, then, on the basis of assumed socio-economic level 
that the experimental and control groups were deemed to be 
Sepcomparable ability. <A questionnaire furnished by Mr. 
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Mewels of the two groups. 

wie scxperimental group happened» to be amethe lanrgecr 
emementary Ssehool in the area, so two schools were chosen 
Eemerorm the control group. This allowed the iuneéricaleeoue 
ere Of the experimental and control groups to be of the 
same magnitude and the results to have a broader base. The 
peamicipals of the two schools tenatively selected as the 
Gemtrol group were contacted and both were eager to coop- 
eee In the experiment. The experimental school was la- 


Mmemed school A and the control schools were labeled B and C. 
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II. PUPIL-SCHOOL BACKGROUND 


A School Characteristics Questionnaire [Appendix A], 
furnished by Douglas J. McRae of CTB/McGraw-Hill, was given 
mamcthe principal of @ach school tomeonpmerermeine ducstiacn. 
Maire requested information on a variety of pupil, stafé, 
amd physical plant characteristics. The schools were all 
@omsidered to be located in residential suburbs. “ihe per- 
mement results of the questionnaire are presented in Table 
I. The number in parenthesis refers to the questionnaire 


item. 


(c 
UESTION AREA 





Mobility ic) 35% 10% IL 
PTA Attendance (4) 20% 20% 10% 
Employed Mothers (6) 5 25% 40% 
Per Cent White (10) 08% 95° 05% 
Kindergarten (a3) 75% 100% 100% 
One Parent (14) a2 % 10% 
Englash Second (15) 43 $ I 
Professional (Clie) 75% 50% 45% 
Mwiate Collar (on) 15% 30% 37% 
Siailed (ek, 10% 15% fog 
Unskilled AL's) 0% a 3% 
PHYSICAL PLANT AND 
ADMINISTRATIVE: 

Plant Age (3) AVES 5 yrs 20 yrs 
New Programs (8) 2-o8mes 2-6 mos 2-6 mos 
Library (17) 7650 - 9000 
meee Fr CHARACTERISTICS: 

Emincipal's Salary (11) 17,145 $17,500 $16,500 
myerape Start Salary (EZ) ff ASO 7 00 7.500 
Average Salary (19) 9,850 - 10,000 
Pocrage Experience (18) 1 V¥ears (35 Beaus oa cars 


feeble RESULTS OF SCHOOL CHARACTERISTICS QUESTIONNATIEE. 
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It 18 apparent that the schools eai fierce ue ya 
areas (€.¢., mobility, emplicyed mothers sand ethe mune ioes 
ene parent homes). There 1S an apparent positive corm. 
mon between the per cent of workine mothers amd thesper 
eomue OL homes with one parent. The experimental schoo 
mad a lower percentage of pupils who had attended kinder- 
Mmearten. This agreed with the higher mobility fisures for 
mime SChool, as it is plausible that a large percentage 
®t the pupils came from a school district outside of 
@meycoria. where kindergarten 1s not required. Two of 
me schools (A and C) have considerably older physical 
memts, but all three schools maintain a clean, neat ap- 
Mearance conducive to learning. 


ae 


im Oti 


r¢?) 


y areas the schools were vcory simgiar. They 
hemomnomogeneous racially; at ieast 95 per cent of the 
plies trom each school were white. They were all members 
Oietne same school district; hence they had the same mini- 
mum math program [Appendix B]. The average experience of 
baie teaching staff was between seven and nine ycars. At 
east 80 per cent of the pupil's parentsywere in white- 
eolmar or professional occupations. From this it can be 
meumed that the socio-econcmic level of the three schools 
was approximately the same, partially confirming the basis 


on which the control schools were chosen. 








III. ASSIGNMENT TO CLASSES AND INSTRUCTIONAL LEVELS 


The school district had sep store Stavesent so. eee 
mum goals and skills to be achieved by the second grade in 
mathematics [Appendix B]. All three schools involved in 
mac experiment used Modern School Mathematics - Structure 
amd Use, published by the California State Department of 
memreation, as the basic text. This text adequately covers 
mre Minimum requirements and exceeds them, continuing into 
Ime taiptication. More advanced texts are available for the 
meee ced Child, who is allowed to proceed at an acceleratéd 
meee The advanced texts extend the concepts covered in 
mmemoasic text and introduce the pupil to division. 

in etme SXpCTimemtdi vrelp, wuciis were into ee 
emomed towne of ten mathematics sections on the basis of 
meaaing performance and a departmental math test. Some as- 
Signments were changed at two different times during the 
menool year. The first shift occurred after the are tee 
was given. The pupils who scored way above the average for 
miecmresection were shifted to am appropriace section. No 
pupils were transferred to a lower section on the basis of 
ime pretest. 

The second assignment change occurred during the mid- 
dle of the school year. Several pupils transferred to and 
mem the school. Persons in authority felt that the new 
pupiis should not be placed in advance sections; not enough 


information was available for basing the higher assignment. 








Hence new students were placed in the lower sections. The 
better pupils in the lower sections weremaiei ted seoead- 
vVanced sections to keep the pupil-teacher 7ami0s conmea7- 
able. It was felt that by shifting pupils in this manner 
no one would be in a section in which the material presented 
was beyond his capability. Unfortunately, the records as 
@emwhich pupils were shifted are not complete. This Ved to 
a difficulty in interpreting the data, and the question is 
Baasessed later in the thesis. 

Five ot Une Sections weme) tawent during the morning 
micerive during the afternoon. This division of the five 
Seeond grade classes into ten sections automatically yxe- 


G@meed the pupil-teacher ratio and, as noted previously, 


This sectional assignment by ability allcwed the teacher 
mempresent instruction at the average achievement level of 
their section rather than the average level of the second 
grade taken as a whole. | 
basteally, theré were five levels of amstruction at 
the experimental school. Level I was taught at an advanced 
mere! to include multiplication and division. Levels II 
ameelil were taught at progressively lower levels. Level 
IV was considered to be composed of pupils who were capable 
of learning at the normal second grade ievel. This group 
was introduced to multiplication near the end of ane acho! 
year. Level V consisted of the under-achievers. They were 
maucht at the jeeond cuadcebevel but with a more repetitious 


imearling process and extensive drixsls in fundamentals. 
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In the control schools, levels were not used. Children 
were assigned to class more or Jess at random; there was no 
distinction made between slew amd fast achwevers: Burane 
the school year the level of instruction was based upon the 
average lewel of achievement within the class. The teachers 
gave special assistance to those children who were lagging 
and to those who were capable of progressing more rapidly. 
ies assistance allowed the pupils in the control sehoots 
to have essentially the same range of instruction as that 
mee Pupils in the experiment group. 

The division of the five classrooms in the experimentai 
school into ten sections formed the basis for the experi- 
ment. These sections allowed for five different levels of 
Meer ructiOn and a reduced pupil-teacher ratzwevct £5 tea 
mone levels I, II, I11, and IV and 10 to 1 forw@evel V. 
limes) iS to be contrasted with the control group schools 
witerc pupils were assigned more or less at random to a class, 
omesgeneral level of instruction prevailed, and the aaente 
meaenecr ratios were approximately Z5 to 1 and 30 to 1 in 
Senools B and C respectively. Further, the pupils in the 
egsperimental school were being taught five or ten minutes 


ies per day’ than the pupils in the control schools. 


1 
Ten minutes for the merning classes and Tive minutes 
fer the arternoon classes. 
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IV. THE CALI FORNDAVACHIEVEMEN Tate oe 


| em mm ee ee Se ee eS 


The experimental and control groups participating in 
ehe experiment and the Gxperimene 21tselm have becnedese wry a. 
Information was presented which indicates that the pupils 
involved were of the same socio-economic level. Presumably, 
@ien, the pupils of the different schools were, on the aver- 
mies, at the same level of learning at the start of their 
See@end grade year. It is recognized, however, that the 
maturity level of second graders can range from kindergarten 
memene third or fourth grade, i.e€., it was not assumed that 
peeeepupiils had reached the same level of academic achieve- 
ment. 
one would expect the same magnitude of achievement in math- 
ematical skills during the second grade school year for 
both groups, if there were no real difference between the 
math programs of the experimental and control groups. 

As mentioned previously, the California Achievement 
ieee 1970 Edition, was the measuring device used to measure 
the achievement level at the beginning and end of the exper- 
iment. The test is designed for the measurement, evaluation 
ama analysis of school achievement in the three basic skill 
areas: reading, mathematics, and language. 

These three skill areas are measured separately. The 
meen section of the CAT measures [Ref. 7]: 

"-the ability to understand the meaning of the material 


Crescneca. 
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-the performance of the studenten eapp ly oie eeoece 

facts, concepts. conventions, Sand. pri neimal men 

problem. solving amethe basic GiWri4a cular ase 

me level of performance Of there tidens anism 

the tools of mathematics in progressively more 

difficult Ssituatiaens.. 
How well does the CAT measure mathematical achievement? 
This question is partially answered by Reference 8. Further 
information, based upon national standardization data, will 
be available when the Technical Repcrt, California Achieve- 
fee Test, 1970 Edition, [Ref. 9], is published. The author 
has had limited access to this material through Mr. McRae, 
mi trom all reports the test is well established. 

Reference 7 contains tables to assist in transforming 

Taw scores (total number of right responses) into Achieve- 
Meme Development Scaie Scores {ADSS). 
were produced by applying the Thrustones absolute scaling 
pmemeedure to the standardization data [Ref. 8 and 16]. 
There are five levels of the CAT designed to cover grades 
eme through twelve. The scale scores have the aay aineece 
Meee articulated across all ievels of the CAT, 1.¢7, 
piegretically a first graders achievement can be compared 
fees that of a student in high school. Further advantages 
@f scale scores over raw scores are that they have interval 
ele properties and are normaily distributed [Ref. 8]. 
emcee, these scores are advantageous for use in statistical 
analysis of the measurement of growth, trends, etc. 


There are two levels of the CAT which pertain to the 


eecond pradée. Level 1 is designed to test pupils im the 


1 as 








first and second grade. level Zecovems sehessecondmemieian 
fourth grace, thus overlapping levels) at the secondee i414) 
It was decided to give level 1 at the beginning of the school 
weanweand lievel 2 at the end et the s@ehool year neomacacon 
two levels were chosen instead of using level 1 twice, was 
memmallow for maximum Spread of the scoresmever the legume 
Sameera and to avoid practice effects, ‘Tnterlevel arracu a. 
fees a correlation measure o£ how well the different 
i[owelsS measure the same attribute (in this case, mathemati- 
eat achievement) at the overlap of the levels. The interlev- 
fea rticulation coefficient for levels I amd 2, CAT, 1970 
Pamermon, is .7/8 [Ref. 9]. 

Feveleleot the CAbwwasmerven CoO all epart ie iene 


a “ 
Cwezres 2 ra 
Ww Vv os (*) 


ted 


schools during the second week of Octcber, i970. 
Wereeoiven during the third week of May, 1971, an elapsed 
time of seven months. The tests were given, monitored 

ama scored by the schools in accordance with Reference /. 
The raw scores were collected by the principals of cae 
Marticipating schools and given to the author within a rea- 
Eomaole period of time after each testing period. 


Bach level of the GAT mathematics section is divided 


ieo two parts: concepts and computational ability. The 
MmemeccOresSw@t Cach area were kept separate. The author 
Gummerted the raw scores into seale scores. These scores 


moe all pupils are contained in Appendix C. 
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VV. METHODS OPS ANA es 


The stated objective of this thesis was to measure the 
achievement gain in mathematical skills of three different 
jets Of pupils, Gompare these measwres ind detcrmine =... 

Some Means whether any one of these measures 1S Signiiireant- 
Jy different from the others. As noted in the Introduction, 
a standard model for answering this question has not been 
abtieained to everyone's satisfaction. 

Memece, due to a lack of a totally defensible matherac1- 
@at model, several statistical techniques were applied to 
the data. This use of several techniques had the advantage 
of compensating for weaknesses in different procedures and 


a arn tN San ES PLAN A Sos ~ S x 4 Sea ge eScnesg pe es as Ses & : Ee eer ate 
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hy 


Oo 
Pemurally the results would be highly correlated and the re- 
feercing value of a second or third successful procedure 1s 
Slight (one should not perform multiple tests on a single 
Semnor data). Joint probability statements cannot be made, 
Only individual {or marginal) ones. It was still held to 

be valuable to have the several tests as backups for one 
another. The three models used in the analysis were the 
Randomized Matched Subject Design, the Analysis of Co- 
Variance, and the non-parametric Mann-Whitney U Test. The 
Mmeecder 1S asked to judge for himself as to which technique 


eupports the experimental paradiom. 


A. RANDOMIZED MATCHED SUBJECT DESIGN [{Ref. 11 and 12] 
In this teés@ there wmust be an equal number of pupils in 


the experimental and control group#. Schools B and C were 


1a) 








combined to form a control group oF Plo ssub ject cuecmm, 
had 94. Random number tables were used to discard the 19 
excess pupils from thé control eroup. After ero precece 
wasegiven, the pupils were ranked @waithin @he expemmmentaL 
ama control groups on the basis of their pretest score. 

lime two groups were then paired by placing the pupil scer- 
ing highest in the experimental group with the pupil scor- 
mieenignest in the control group, etc., until 94 pairings 
Meme Obtained. After the posttest was given and scored a 


meererence score was obtained on each pair in the following 


yawn er : 
Dy re ee ee 
1 1 il 1 
where 
Xo. = pretest of ith pupil in the control group 
1 
ae ~POStcest (ot ith pupil in the control group 
1 
Xe ST imeetes tO: jth pupil An “she experineneal wor oup 
tL 
Yr. = posttest of jth Olpawewae tive -oXDC riive l) Ga lsseaee moe 
1 


With a difference score thus defined, the question of 
Pierner Or not the program of the experimental group was 
emter than that of the combined control group can be ad- 


dressed with the following statistical hypotheses: 


He uy = 0 
Hy: yu > 0 
Where Ww is the mean of the difference scores. If during the 


analysis, Ay the null hypethesis, 15 not rejected, this a a 


20 








imply that no significant difference existseri ticmp noc 
of the experimental and control groups. It was assumed 

Eat the difference scores had a normal distributions, oon 

a mean of zero and an unknown variance. Under thesemas- 
semptions, the Student-t distribution yields the junitenn:, 
most powerful test against all one-sided alternative hypoth- 
eses. It was used in this thesis to test the null hypothe- 
eo presented above. 

The randomized matched subjects design has a distinct 
wireamease over a design where the difference between the 
estimated gains made by the experimental and control groups 
memmaynoorthnesized to be zero. The pairing on the pretest 
Seeres introduces a high correlation between the groups on 


ceranraec TR Ss mm manana +-aAn dacranancac ie actamata 
i Ney Aas as © ee ae eee toh h Ped ak wages ioe) Bie Oe NG cae A eo Pi weihw DO cM mie, we 


posttes 


meme Variance by an amotint equal to twice the covariance 
Beene means. Thus the randomized matched subjects design 
meammore Sensitive to differences between the two groups 
mitmea deSifn where pairing is not done, since the denomina- 
memeeor the critical ratio is reduced. 

ine wrimany disadvantage of the randomized matched sup- 
Hemme GeSign is that an implicit assumption is made, 1.e., 
mmemscores Obtained on the protests and posttests are true 
weeres. In reality the obtained score on any test can be 
mmought of as a true oe DPlUGeCTT Or a Wwhe Tew e110 ime ai 
be modeled as a normally distributed random aneteble vac 


zero mean and unknown variance. The difference score for 


een pupil, as defined, 1s the difference of two normally 


Zu 








distributed random variables. The sum of these differences 
has a normal distribution but the variance may change from 
pupil to pupil. Thus the Student-£ procedure 11sec jn 
a variance which is actually a composite of each students 
Variance rather than the value common to all. Hence this 


procedure for comparing growth may »e questioned. 


Be ANALYSIS OF COVARIANCE [Ref. 13] 

The second method used in the analysis of the different 
mathematics program was the Analysis of Covariance. [In the 
Bm@elysis the posttest score is considered to be dependent 
moon the pretest score, which 1s called the concomitant 
variable. The Analysis of Covariance takes advantage of 
mrewintoOrmation furnished by the concomitant variable in a 
fem@ec that the posttest scores aré “corrected” by taking 
mimoeaccount the differences between the groups on the 
Semeomitant variable. After the posttest scores are “cor- 
meetea' the procedure reduces to a straight analysis of 
Variance. Thus the results from the analysis of covariance 
MemenaccUurately reflect any differences that are due to the 
Gmemerent programs than a straight analysis of variance 
memea On the posttest scores. 

The assumptions of the analysis of covariance are: 

1. <A random sample of size one is drawn from each of 
N populations. 

2. Each of the populations is normal and has the same 


wmariance. 


Z7. 








3. The population means within each group lie on a 
straight line and the slope of the line is the same for each 
group. 

Emmce the concomitant variable 1s measured with errors 
the obtained scores are not necessarily true scores) it 
must also be assumed that the assumptions hold for all pos- 
Sible values of the concomitant variable. 


tne model tor the analysis ©f GoOvaliance Can Dewwere- 


em as: 
ij oe oe oe Dees og) 3 ie Belo es a 
= ee ree 
ge 2 N(0,07) 
i ie 
% jg = 2 a 
where 
im - the mumber of schools. 
ote = the number of pupils ain the jth Se ool, 
ij = pretest of jth Buna baie se jth school. 
Yi; = posttest of ith pupil in the jt® school. 
B SRO nalieet 1ecu on jth school. 
Y = slope of regression line (posttest on pretest). 
m= the average of ail r population means. 


Mire Objective Of the analysis 15 toe choose between the 


Poeplowing set of hypotheses: 


an, 


nn ee j=1,2,.--5T- 


Hy: ROtrall Beare tne wsame 


Zo 








The null hypothesis implies that the corrected program ef- 
fects sare abiweaquaie 

The robustness ©2f €his tésSt 1S sitet sac ewe iene yeas 
piatwor the strazent analysis Of yariamee, — bel dima ieee 
Mia Paper comparing several techniques) Statea thats mic 
number of assumptions required for a valid analysis of co- 
Variance renders the technique generally less applicable 
m@man Other statistical tests used for the same purpose. 

The failure of the data to meet the assumptions is thought 
mMembe more serious in analysis of covariance than in straight 
analysis of variance (the Student-t in the two sample case) 
especially with failure to satisfy regression assumptions. 

He ldt investigated the preeision Of a Pactormatadecacm 
mmmen the analysis cf covarvemce. He digmovered thet for 
regression correlations of greater than 0.6 analysis of co- 
Weampence has the greater precision. Since CTB/McGraw-Hill 
Meemestimated this correlation (their interlevel of articulation) 
membe 0.787 in their standardization procedures, it was 
bemerevyed that the analysis of covariance in this thesis 


would yield valid results. 


C. MANN-WHITNEY U TEST 
A third test was presented for those readers unwilling 
to accept the assumptions in the analysis of covariance or 


ifesvalidity of the randomized matched subjects design. It 


2 
Scecmtie dt valiccemmn Tables = ix and Figure | “tom wales 
obtained during the analysis. 


24 








is the non-parametric Mann-Whitney US tes tds ee one 
was again necessary to form the two Control schools into 
one, but it was not necessary to delete the 19 pupils as 
in the randomized matched subjects design. The achieve- 
ment gain score (posttest-pretest) was computed for all 
pupils. Again the thesis in question could be answered by 
a set of hypotheses 
HH: The achievement gains for the experi- 

mental and control groups have the 

Same distribution. 

H,: The achievement gains for the experi- 
mental group are stochastically larger 
than those of the control group. 

There are no assumptions made as to the dastribution 
Of the achievement gains other than the distributions are 
@eeinuous and that themonuscrvangens unoreomp ete ly aaa 
pendent. Siegel [Ref. 15] states that the power efficiency 
Geethe Mann-Whitney U test approaches 95.5 per cent for 
large sample sizes compared to the Student-t. It is, there- 
£O0re, an excellent alternative to the Student-t, and it 


does not have the restrictive assumptions associated with 


the analysis of covariance. 








V1. JINDAL RES 


The set of hypotheses presented in each of the three 
different techniques of analysis were all designed to ans- 
wer the same question: Was the achievement level in math- 
ematics of the pupils in school A larger than the gain 
temmally attributed to the learning process during the sec. 
emo crade?’ If the null hypothesis was not accepted during 
the analysis it would be a strong indication that the change 
taemathematical programs at school A contributed signifi- 
@amucly to the gains made by the pupiis. If the opposite | 
Gcecurred, it could be concluded that the program had no 


Significant effect. 


mA OF FRR FRI FATS Oe ~~ a 


eeveistical analysis and the significance level of all 
tests was arbitrarily set at the 5 per cent level. This 
Significance level is the probability of a Type I error, 
@2¢., the probability that the null hypothesis is rejected 
when in reality it is true. The power of the tests, i.e., 
the probability of failing to reject the null hypothesis 
when the alternative is true, were not determined. 

All three statistical techniques discussed were used 
on the total test score and each of the subtest scores, 
ime. , Srasp of concepts and computational ability. The 
Peete results are presented in Tablle 11. The table includes 
the estimates of the mean and standard deviations for each 
eeepene three scores. The remaining results in Ehis seictiom 


apply only to the total scores. 
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In the randomized matched subject design, the estimate 
ef the mean of 94 difference Scoresmwas 974 pees aimee 
standard deviation was 48.7. These figures combined to 
yield t = 4.86 > 1.99. Hence the null hypothesis was re- 
feeted. 

The results of the analysis cf @evarianee are con. 
memed in Table ITT. F = 13.03 >°3.U00, hence the null hy 
Meenesis was rejected. The rejection of the null hypothe- 


mein the analysis of Covariance only indicates that wall 


SOURCE SS SP SS Ss! d.f. MS! 

x xy y y y 
TREATMENT b28.4 S984 S75600.9 30.165.2 2S anne 
ERROR 150,788.5 157,041.1 399,519.6 234.966.5 2030. masyes 
TOTAL 151,316.9 160,855.2 436,126.44 265,131.7 


iepLE Il]. ANALYSIS OF COVARIANCE, TOTAL SCORE. 


Preerans were not the same. <A technique exists to determine 
Pioben program or programs were significantly different. A 


ie ar combination of the form 


3 
BSE Glens ee elie » a 
dpe! 


where the fa are constants, -1syeal led eavcontrast.. Scie fica 
method [Ref. 15] was used to obtain confidence intervals for 
weaeral cOomtrasts and to determine which contrasts were sig- 
Maticantily different from zero. B., i = A, B, C are the 
meogram ecitfects of the different schools. The results are 


Goemcained 1n Table IV. 
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CONTRAST INTERVAL 


een 10.5 <L < 39.9 
Bee 10:0) Sie sy a0 
Ba Be 17 Se 
B,~%(BptBo) SS Se SSS 


fees IV. CONFIDENCE INTERVAL FOR CONTRASTS, TOTAL SCORE: 


It is obvious from Table IV that school A is the reason the 
Mol hypothesis was not accepted. The confidence intervals 
weocidated with school A do not contain zero and are greater 
man Zero. 

Picci Whitney sUeuCstny ie Ldede tie eSameete Sl) oieacmr 
Mitemmull hypothesis was rejected. The achievement gain 
mmemes yielded a U statistic of 6884.5 (correction for 
Mees Was included). This transformed into a normalized 
variate, z = BBR) The probability Of thilsseccurriamo. me 


Meeols < 0.05. 








VII. DISCUSSION OF THE RESULTS AND EURIDE RANA oI 


All three methods of analysis soundly rejected the 
mull hypothesis. Hence, £rom the preliminamw res Ube c eee 
mas concluded that the (division of elasses into (siamese. 
tions, composed of pupils of comparable ability, signifi- 
emtily increased the achievement gains during the second 
Meeade school year. 

me ENS pOIlNu tie aUchoarwdeclded te Gere mile wan meine 
paegrams were significantly different in both areas o£ the 
@rmatormia Achievement Test. The analysis of covariance 
Mas chosen as the technique for further analysis based on 
Mmmemeauchor's belief that 1t yielded the most information 
Meemeeie bo woe broades based Since tne contro: Schoo. 
femennot Combined. The results of the analysis of the con- 
Mee Scores are presented in Table V and computation scores 
Mmmrable VII. The contidence intervals for Contrasts are 


contained in Tables VI and VIII, respectively. 


SOURCE S oP Sto) So). di teense 
5 XY ue Mu Y 


TREATMENT 1,728.4 2,708.5 38,634.4 35,482.1 2 17,741.0 
ERROR LIS TES, NAS Pees <2 0), ILM ISS 


TOTAL 300,655.9 232,891.8 496,616.9 316,293.5 


iaABEE V. ANALYSIS OF COVARIANCE, CONCEPT SCORE. 


hoe COMmGCe pt Scores tie st atasat1c uh) — 7.85.2) 5.00pemence 
the null hypothesis was also rejected for conce»ts, and from 


tjmole VI it is clear that school A was the reason. 
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CONT RAST INTERVAL 


B,-Bp CS She eo ol 
BR-Be at ares) = IE < ZO 7. 
B,- %(B,t+B) 12 Oe a ones 


ieee VI. CONFIDENCE INTERVALS FOR CONTRASTS, CONCEPTS SCORE. 


SOURCE es SP Gc ea d.£. MS! 
X xy y y iy 


TREATMENT 50246 3.565.329 05740 25.948.9 2 aeonae 
ERROR onto? 5, 55020 5200072.5 250-525.5 2050 soem 


TOTAL Is 2A SAAS ES oe 7 ILS ee emo} 2) 0) EOD RA eye rarA 


feiss VII, ANALYSIS OF COVARIANCE, COMPUTATION SCORE. 


CONTRAST INTERVAL 
B,-Bp 9.8 <L < 40.1 
Bp- Bo =25.5 s ip < 9.4 
8,7 (B_tBe) 9.4 < lle < Susyed o 


ijeobe VIII. CONFIDENCE INTERVALS FOR CONTRAST, COMPUTATION 
SOs 
From the analysis presented, thus far, it is clear that 
[me program at school A allowed for sizeable achievement 
mans in all math areas of the CAT. A regression of the 
feast test scores on the pretest scores, total scores, was 
done to see how well the analysis of convariance assumptions 


more mete the results are contained in Table 1X. The 


Syl 








slopes of the regression equations were within reason for 
all of the schools, but the correlation was low for cero 
A®. Histograms were made for all three schools on the pre- 
test and posttest total scores. Ableappeared to besnom we, 
sustributed except the posttest scores o: School i eed 


modal distribution appeared, and further analysis was sug- 


gested. 
SCHOOL REGRESSION EQUATION CORRELATEON 
A Neel Ose2) ware Om.x 0.545 
B N= = 400 es 15 Xx 0.808 
ie Y sueetseoo + Ono Ost 


TABLE IX. REGRESSION ANALYSIS ON TOTAL SCORES. 


Ai IMVEeSticdtion was Desun to see if the apparent. oa. 
modal distribution of posttest scores of school A was as- 
sociated with the section assignments. At this time it was 
mescovered that complete details of final section assign- 
Memes were not available. The final assignments were dif- 
Bement from the original because of shifts during the school 
year. The reason for the shifts were explained in detail in 


Beetion Iii of this thesis. The assistant principal and the 


only available teacher concerned with the experiment assisted 


esis possibile ww tinoucn Statist lemeeteenn1 dues CO cece 
mre slopes tor equality and to obtain confidence intervals 
for the correlation. Equality of slopes was assumed in the 
Analysis of Covariance and the tests were not done to avoid 
debate about the significance level of the Analysis of Co- 
Mariance. 
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the author in verifying 79 (out sot 94) ot then a eaeeon 
ments. This information was not considered to be complete 
enough fOrnreturtie weaial cic 

A cluster analysis computer program was Obtained. a26m 
fieemeRac [Ref. 16] Couseparate thie, pupils nto elusien 
mased On their test scores. The program contained several 
Gimcerent methods to accompiish this. Two of the methods 
were used with identical results. The program separated 
@teepupils of school A into two clusters; basically, the 
elusters consisted of leveis I and II, labeled cluster A; 
meeeleveis Til, IV, and ¥, labeled cluster B. Since 79 
of the final assignments were Known, it was easy enough 
Mmemverity the soundness of the routines. Four errors 


ye of fut er Pig SS ee = ee He es = IP es a a = sees Pts ase Py dey es 
Dia Ta on ser £8 6 tered oe v\ ea & evoeen es oe = aT Vila int sta WY L hE fora ha « FN t hae 
WRN 4 Nw CHES: ewe ew Wee A4idA a Ww PMPsAS AY wan whew 2 WwW AN 34 ih oe we 9 we Ue Ly we “AAA weer KY OL 


A were assigned to B and one who was known to be in cluster 
Beas assigned to A. The author considered the number of 
known errors to be sufficiently small to avoid appreciable 
Peeserepancies in further analysis. | 
EMOotner Analysis of Covariance was wun on wehne Cotval 
Beeres. this time four groups were included in the analy- 
fee OChool A-cluster A, school A-cluster B, school B, 
eremschool C, The results are presented in Tables X and 
ole, 
iMemstatistilew: ="99 [299-7286 Hencerwthe null hyvpounesis 
was rejected. It is clear from Table XI that the scores from 


pamoot A-eluster A accounted for the rejection of the mull 


bye: 








hypothesis, Cluster A consisted ci these pupils asemenea 
to levels I and II where the level of instruction was the 


most advanced. 


SOURCE 95 oP 95 25! d 2 eiSe 
x XY Y A uh 


Me AIMENT 10,539.49 45.806 ..5 239, 3047 lesa 79977) es 52 400922 
ERROR L407 (ied 22048 9 Loos S225. 07 Oe 930.4 


TOTAL Poor. 100.855.272.456. 620 ie Oro , leila 


MepLE X. ANALYSIS OF COVARIANCE, TOTAL SCORE. SCHOOL # 
SEMA) INTO yO ee hus Lik Sr. 


CONTRAST INTERVAL 

Bap 8, 2155 le pO 
Bp Br =O Rae 19 
Ban 8p SHUN << DU ere 
Ban 8c yeas She SoS eal 
Ban Pap Com ccetlance o.5.00) 
Bp “Be Sa oe sy) <p Sn, 


ieee Xl. CONFIDENCE INTERVALS FOR CONTRAST, TOTAL SCORES. 
SUNOCO SEPARATED PNEO MI VOGie Sinks 
iitesscattem diagram tor Senmeol A 1Sesewm an Figures: 
Mites regression equations and correlations for the school as 
a whole and separated into clusters are snown on the figure. 
iitemecorrelation of the pupils in cluster B increased to 0.823, 
while the correlation of the pupils in eluster decreased ste 


Oei24. 
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POSTTEST SCORE 


FIGURE I. SCATTER DIAGRAM AND REGRESSION ARALYSIS, SCHOOL A. 


ze LEGEND: 
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Sy = 50.84 
S yy 742.62 


r-6.545 
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CLUSTER A ——-—- 
Y* 378.8 +0.097 X 
Syjy 719.28 


r=0.124 





530 360 


CLUSTER B ------ 
Y= 129.48 +0.659X 

Sy * 30.43 
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VIII. CONCLUSIONS 


In the Introduction it was) questionecdswacrnc peaeene. 
posed change in a mathematics program would lead to in- 
creased math achievement. The author contends that thru 
Statistical analysis this thesis established the increased 
achievement beyond a reasonable doubt. 

From the confidence intervals in Table XJ, it 1s clear 
MmoteenO Significant ditferences existed between cluster B 
eeschool A and schools B and C; the confidence intervals 
fomeenc appropriate Contrasts all contain zero. This re= 
avi, along with the information that cluster B atysehool A 
maa a Sample standard deviation of 30.4 on the posttest 
oeeme, led the author to conciude that no one was nurt by 
Miesprogram. It afforded the slow achievers the opportunity 
Memneceive concentrated instruction at thégr level. The 
miecethat the concentrated instruction at level V assisted 
those pupiis to advance faster was reflected by the low 
variability of cluster B's posttest scores compared with 
Semools B and C. 

As far as the author can ascertain from the analysis, 
the average student was unaffected by the program. It is 
readily apparent that those pupils in levels IJ and I1 re- 
Geived the greatest benefit. The means for levels I and 
ir; on the posttest score, was abowe the ggth DETeemtile 
@©f the nation based upon the standardization data of the 


CAT. On the average, cluster A missed less thun two questions 
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per child out of 11/7 on thempostees 1 la isec ie wee eee 
suit further suggests that cluster A could have scored even 
higher on an expanded test. Since the CAT was designed to 

be used through the fourth grade, the second graders are to 
be commended on their accomplishment. 

It may be questioned whether or not tne teachers at 
meme experimental school were concentrating their instruction 
ai the area of the CAT. The author talked to one of the 
teachers in the experimental school and to the principals 
ieee) three schools: He was assured that “teaching to 
the test” was not done at any of the schools. 

te 1s noted. )that sesimec the experimental teachime 


program was conceived and implemented by school A teachers, 


=} 


the so calied "“ilawtnorne effect" jRef. 17} may have been 
Operative. The results are quite striking, however, and 
the author feels that if such a program were administered 
by other teachers the improvement indicated here would not 
be diminished by any large degree. Finally, it is noted 
moat mone of the students at any of the schools were aware 
See ne experiment; only tne teachers. 

itis hoped that thas thesis 1s of some ald im keeping 
ee program at school A; including the extension of it into 
the other elementary grades. It is further hoped that it 


ea be used as a basis to implement the experimental pro- 


tram at other schools with the school district. 
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APPENDIX A: SCHOOL CHARACTERISTICS QUESTTONVATRE 


ilies How many students are enrolled in this school at each 
Ge the ol lowamo Jo cclemeltcns— eo. 


K i 

aa 3 
Z 9 
oa Ome 
4 oe 
sO 2 
6 4 


meeenbout what percentage of the students who attended this 
school last year are no longer attending this school (do 
not count those who have moved because of graduation or are 
beameg bussed to other schools)? 


0 
0 


eee How old is the main classroom building of this school 
mlant ? 


Vedas 0 Idi 


We oe! Weet Sercent cf Ehe families Gt st 


TY 
meoo! are represented at a typical meeting o 
Similar parent group? 


dents a 
f the Wak 


0 
i) 


Semevnich of the following categories best describe the nei- 
ghborhood served by this school? 


rural 

residential suburb 

INdUStT Tal SubUay Dp 

small town (5,000 or less) 

Git yVvOte or VOCE ORs OS OU) 

residential areasorea, larce ‘erty, 
(50,000+) 

C weetnne P pants Od ta eance weit (500 U0 


rh O eng 7a} 


6. About what percentage of students in this school have 
Hermers who are employed outside of the home? 


yy 
0 


7. From which of the following groups (check all that apply) 
is formal approval required to initiate new education programs 
in this school (e.g., team teaching, new curricila, ungraded 
classrooms, resource rooms, etc.)? 


58 








Board of Education 
SUpe TIN fendec mr 

DIStriet admin sera aron 
Other than’ superimeenden« 
Papremcs 

Teachers 

No formal approval needed 





8. About how long does it usually take to implement a new 
educational program in this school {i.e., from the time the 
Hecision 15 made» to, adepe at until che fame mites Seni, 
introduced) ? 


Moreen s 


9. (a) (Elementary Schools) What is the copyright date of the 
regular class reading book used in the third gerade az 
eMes School ¢ 


(se) (Junior and senzor hiek schools) sWhat 1s thescon . 
right date of the regular American history text used 
i Ciiceschoor? 


mee About what percentage of the students in this school are 
foal © ? 


ieeetiaat 1S the annual salary of the principal of this school? 
a 


imeewhat 1S the starting annual salary of a fully certified 
beginning teacher in this school system? 


$ 


moe elementary schools only) About what percentage of the 
Students now in Grade 1 in this school attended Kindergarten 
feits equivalent? 


Q 
0 


meeeADOUt What percentage of the students in this school are 
mina? In Memes in which there is only one parent? 


% 


ie DOUt What percentage of the students in this senool 
Speak a langugage other than English outside of school or 
come from homes in which a language other than English is 
spoken most of the time? 


6 


a0 








16. About what percentage of the pupils served by this 
school fall into each of the categories listed in the 
chart below (the total should equal 100%)? 


Occupational Category 6 
children of professionals and managers 

(doctors, lawyers, engineers, executives, 

etc) 

children of white collar workers other than 
these in (a) above (proprietors, salesman, 
eer Su. C&C.) 


children of skilled workers (electricians, 
Carpenters, repair men, factory workers, etc) 


children of unskilled workers (laborers, 
janitors, dishwashers, etc.) 


TOTAL 100% 


17. About how many catalogued volumes are there in the li- 
brary of this school? 


volumes 
Memeinact 1S the average full-time teaching experience of the 
@eaening staff of this school (consider counseling as teach- 
memo XPeT1ence } ? 


years 


19. What is the approximate average annual salary of the 
Pecenineg staff in this school’? 


$ 


ZU0= Please estimate in the chart below the number of hours per 
Meek that cach of the specified kinds of people are working 
mretnis school: 





iype OL person Penours per week 


Guidance Counselor 

Pevecnolonis t 

Child Welfare and 
Attendance Office 

Nurse 

Seeecn) (ne rapist 

Remedial Reading 
Specialise 

EMcs eSii=wecon c= 
Language Specialist 
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f hours per week 


Art Teacher 


Sex EduGatronueconstu tame 
\Patleige Goal ela 
Teacher Aides 
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APPENDIX B: SCHOOL DISTRICT MINIMUM MATH PROGKAM- > GRADE 2 


Students in the second erade 


-will be able to read numerals through 200 and write numerals 
~arough 100, and to cetint to 100@ny 1s and 108] ane te 
ZO by 5's 


-will gain a working knowledge of the concepts of numbers 0 
through 99 and be able to identify the number before and 
after a given number through 99. They will have some ex- 
Periences with numbers 100 through 999 including the idcen- 
irr cation of place value through the hundreds place. 


-will be exposed to number words through "ten." 


-will gain a working knowledge of the ordinal numerals first 
mirough seventh (using teacher-made materials) . 


-will be able to demonstrate an understanding that there are 
many ways to write the numeral for a given number. 


“will master the addition and subtraction combinations 
@mrough 15, with emphasi Ss on vertical notation, and will be 


able to complete CMe GQQGCLLION aie Str erecLeo:: sentences 
Within these number families. 


-will be able to demonstrate an understanding of and be able 
to add and subtract two-digit numbers which do not require 
Smrying or borrowing. 


-will be exposed to and have some practice with addition of 
moree-digit numbers with no Carrying. 


-will be exposed to and have some practice with addition 
problems involving carrying and with subtraction problems 
involving borrowing. 


-will be able to demcenstrate an understanding of, and be 
able to work addition problems with, three addends up to 
and including two-digit numbers. 


-will be able to demonstrate an understanding of the meaning 
of the mathematical vocabulary and symbols included in the 
minimum program for grade two. 


-will be able to demonstrate an understanding of and be able 


to work oral story problems using the concepts and skills 
mastered. 
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-will be able to demonstrate an understanding of the con- 
cepts of 1/2 and 1/4 and be exposed to 1/3 as part of a 
whole- 


-will be able to tell time to the nearest hour and half- 
Jaen we 


-will be able to use pennies, nickels, dimes, quarters, 
ana dollars. 


-will have further exposure to a calendar and will develop 


further understanding of a day, weck and month in measuring 
time . 
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APPENDIX ee ole Orr > 


SCHOOL A 

STUDENT PREP a Sm POS I TESt 

NUMBER CONCEPTS COMPUTATION TOMAL CONCEPTS COMPU TON Sie 
1 522 303 20) 555 Sis Sle 
2 5s 405 316 444 Bi & $5) Fi 
g 346 258 Je il 3 282 288 
4 520 Lo Zw 428 400 405 
5 ey be 260 265 464 400 426 
6 304 SAL fi Win 8 464 400 426 
fi 522 276 280 402 Oy S22 
8 Bee, 280 122)10, 505 309 S22 
9 Ge) 5) SAF OSS 428 385 5o7 
10 346 L135 PEs Ik 428 Gl & coat 
11 Soo 286 /L2) 428 535 Gish 
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